New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Stochastic tutorial added #185

Open

weisscharlesj wants to merge 7 commits into numpy:main from weisscharlesj:stochastic_tutorial

weisscharlesj commented Jun 24, 2023

Added tutorial-stochastic-simulation.ipynb tutorial which uses the NumPy random number generator to simulation various processes and solve problems. This tutorial was proposed in issue #184.

weisscharlesj added 3 commits

June 24, 2023 15:56


          Uploaded images for stochastic tutorial

bc05d6f


          Added stochastic tutorial ipynb file

27a7738

Added tutorial-stochastic-simulation.ipynb file which contains a tutorial demonstrating using NumPy random number generators to stochastically simulate a variety of processes. This tutorial was proposed in issue numpy#184.


          Update README.md

d8d5b47

Added tutorial-stochastic-simulations to README file

Mukulikaa reviewed

View reviewed changes

README.md Outdated Show resolved Hide resolved


          Update README.md

bc946ee

Co-authored-by: Mukulika <[email protected]>

Member

bsipocz commented Jun 28, 2023

Hi @weisscharlesj 👋 Thank you for the PR.

Before diving into trying to sort out the build issues, I wonder whether you're willing to agree for us to use this PR as a platform for updating our contributing guide. I've noticed a few things that are missing from the How-to-contribute, and some of those are causing the issues with the build/tests here.
This is all independent from the content of your tutorial, and I'm happy to help or do these infrastructural changes to the PR.

Author

weisscharlesj commented Jun 28, 2023

You are welcome to use this PR for whatever you need to including updates to the contribution guide.

rossbar and others added 2 commits

July 30, 2023 18:38


          MAINT: Add stochastic tutorial to toctree.

89c3bb7

This is necessary to get the sphinx build working.


          ENH: Convert stochastic tutorial to myst version.

39c80ea

numpy deleted a comment from review-notebook-app bot

rossbar added the content label

rossbar reviewed

View reviewed changes

Collaborator

rossbar left a comment

Thanks for your submission @weisscharlesj . Sorry for the slow response, it's been a busy summer so far!

I've taken the liberty of making a couple structural changes to get this into a reviewable state, namely:

Converted the .ipynb to a myst-markdown notebook (a necessary step for review)
Added the tutorial to the features toctree to fix the build errors and make the tutorial accessible on the site.

My goal in pushing these up is to get over the red-x on CI and make this reviewable - I haven't (yet) modified the content itself in any way!

I'll aim for a review of the tutorial itself ASAP. If you want to make any changes in the meantime, be sure to git pull first!

rkern reviewed

View reviewed changes

content/tutorial-stochastic-simulations.md

+              You may have guessed by looking at the returned values that `random()` produces
+              values in the 0 $\rightarrow$ 1 range, but what happens if we need values in a
+              different range?
+              We can modify these values by mutiplying them by a coefficient to increase the

Member

rkern Jul 31, 2023

Here is an opportunity to recommend using the uniform() method. It's the idiomatic way to accomplish this.

content/tutorial-stochastic-simulations.md


		+++

		## Calculating pi

Member

rkern Jul 31, 2023

Personally, I would omit this example entirely. I understand why it's included in various treatments of Monte Carlo approximation methods, but I think that everything that makes it an accessible example are exactly the reasons why one shouldn't use Monte Carlo techniques with pseudorandomness. Of course, no one actually needs to calculate pi to this rough level of accuracy at all, but even the kinds of practical problems that look enough like this one should be solved with other techniques, like Quasi-Monte Carlo, if not straight-up numerical integration.

I think we're on much firmer ground to use PRNGs when we are simulating actual stochastic processes or evaluating probability puzzles like in the other examples. The difference is that in this example, the property of the sequence that we're looking for is just (asymptotic) uniformity. PRNG sequences have that, but other sequences, like those from QMC techniques or even just grids, have that much better. The other examples also rely on independence, which PRNGs have (for practical purposes) and QMC sequences don't.

content/tutorial-stochastic-simulations.md

+              the circle a radius = 1.
+              This requires the coordinates to fall in the [-1, 1) ranges along both the $x$-
+              and $y$-axes.
+              We have no random number generator that produces values in this range, but we

Member

rkern Jul 31, 2023

Please be careful with these claims. We do indeed have a method for exactly this purpose: uniform().

content/tutorial-stochastic-simulations.md

+                  undecayed_array = np.full(t_final + 1, n)
+                  for second in range(1, t_final + 1):
+                      decays = rng.binomial(1, p=k, size=n_undecayed).sum()

Member

rkern Jul 31, 2023

What exactly did you want to show here? decays is binomial-distributed, but the idiomatic way to compute this with the binomial() method would be:

decays = rng.binomial(n_undecayed, p=k)

If you wanted to show how the binomial is constructed out of a sum of Bernoulli trials, you can do that, but I think it's confusing to use binomial() to make Bernoulli trials only to sum them up. The idiom for getting Bernoulli trials with a probability of k, and then summing up the successes looks like this:

decays = (rng.random(n_undecayed) < k).sum()

content/tutorial-stochastic-simulations.md

+                  all_unique_class = 0  # number of classrooms with students NOT sharing birthdays
+                  for classroom in range(n_classrooms):
+                      birthdays = rng.integers(0, high=365, size=class_size)

Member

rkern Jul 31, 2023

Overall, I think this tutorial is an opportunity to demonstrate good practice for passing PRNG state to functions that consume pseudorandomness. Namely, each function should take an rng=None argument and execute rng = np.random.default_rng(rng) before calling any methods. @albertcthomas has a good article on this, though we are converging on using rng as the name for the argument instead of seed.

I think that's more critical information for writing stochastic simulations than details about any particular Generator method call.

content/tutorial-stochastic-simulations.md

+              integers.
+              If we changed the number of layers to an odd number, we'd only get odd positions
+              in the result, and if we used +1/2 and -1/2 for our horizontal movement, we'd get
+              both even and odd integers.

Member

rkern Jul 31, 2023

I think you can probably omit a lot of this explanation if you just left the values as 0s and 1s and just say that 0 means the left path was taken and 1 means the right path was taken.


          Merge branch 'numpy:main' into stochastic_tutorial

817edf8

Contributor

Mukulikaa commented Oct 23, 2023

Hi, @weisscharlesj. Just a gentle ping to see if you have had the time to address the comments on the tutorial.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

content